NEAR-OPTIMAL MULTIPLE-INPUT MULTIPLE-OUTPUT (MIMO) CHANNEL 
DETECTION VIA SEQUENTIAL MONTE CARLO 



CROSS-REFERENCE TO RELATED APPLICATIONS 

[0001] This application claims the benefit of U.S. provisional patent application 
no. 60/450,777, filed Febraary 28, 2003 (Docket No. NEC02007/ 16792), incorporated 
herein by reference. 

BACKGROUND OF THE INVENTION 
Field of Invention 

[0002] The invention relates generally to digital data receivers. 
Description of Related Art 

[0003] The ever-increasing demand for high-speed wireless data transmission 
has posed great challenges for wireless system designers to achieve high-throughput 
wireless communications in radio channels with limited bandwidth. Multiple transmit and 
receive antennas are most likely to be the dominant solution in future broadband wireless 
communication systems, as the capacity of such a multiple-input multiple-output (MIMO) 
channel increases linearly with the minimum between the numbers of transmit and receive 
antennas in a rich-scattering environment, without increasing the bandwidth or transmitted 
power [1,2, 3]. See the Appendix for references. Because of the extremely high spectrum 
efficiency, MIMO techniques have been incorporated into several standards of various 
wireless applications, such as the IEEE 802.1 la wireless LAN, the IEEE 802.16 wireless 
MAN, and the WCDMA standards. 
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[0004] The Bell-labs layered space-time (BLAST) architecture is an example of 
uncoded MIMO systems now under implementation. In the literature [4, 5, 6], different 
BLAST detection schemes have been proposed based on nulling and interference 
cancellation (IC), such as the method of zero-forcing (ZF) nulling and IC with ordering, 
and the method based on minimum mean-squared error (MMSE) nulling and IC with 
ordering. The performance of these simple detection strategies is significantly inferior to 
that of the maximum likelihood (ML) detection, whose complexity grows exponential in 
terms of the number of transmit antennas. In [7, 8, 9], sphere decoding is proposed as a 
near-optimal BLAST detection method, whose complexity is cubic in terms of the number 
of transmit antennas. As hard decision algorithms, the above schemes suffer performance 
losses when concatenated with outer channel decoder in coded MIMO systems. In [8], a 
list sphere decoding algorithm is proposed to yield sofl-decision output by storing a list of 
symbol sequence candidates. However, the complexity is significantly increased compared 
with the original sphere decoding algorithm. 

[0005] The present invention provides a new family of demodulation 
algorithms based on sequential Monte Carlo methods. The new algorithms may 
advantageously be used for soft MIMO demodulation and achieve near-optimal 
performance with low complexities. 

[0006] The sequential Monte Carlo (SMC) methodology [10, 1 1, 12, 13, 14, 
15] originally emerged in the field of statistics and engineering and has provided a 
promising new paradigm for the design of low complexity signal processing algorithms 
with performance approaching the theoretical optimimi for fast and reliable communication 
in highly severe and dynamic wireless environments. The SMC can be loosely defined as a 
class of methods for solving online estimation problems in dynamic systems, by 
recursively generating Monte Carlo samples of the state variables or some other latent 
variables. In [10, 1 1, 12, 15], SMC has been successfully applied to a number of problems 
in wireless communications including channel equalization, joint data detection and 
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channel tracking in fading channels, and adaptive OFDM receiver in time dispersive 
channels. 

BRIEF SUMMARY OF THE INVENTION 
[0007] The present inventor has recognized that an SMC detector may 
advantageously be used as the soft-input soft-output demodulator in a turbo receiver. In 
particular, the present invention encompasses a new class of detection schemes that may 
advantageously be used for soft-input soft-output MIMO detection for quasi-static MIMO 
fading channels. 

[0008] This novel class of receivers is based on the SMC framework and can 
have either stochastic or deterministic settings. The stochastic SMC demodulator employs 
the techniques of importance sampling and resampling, where the trial sampling 
distribution is formed by exploiting the artificial sequential structure of the existing simple 
nulling and cancellation BLAST detection scheme. The deterministic SMC demodulator 
recursively performs exploration and selection steps to detect the final survivor signal 
paths with the corresponding importance weights. Being soft-input and soft-output in 
nature, these SMC detectors can serve as the soft demodulator in a turbo receiver for coded 
MIMO systems, where the extrinsic information is iteratively exchanged between the soft 
demodulator and the soft outer channel decoder to successively improve the receiver 
performance. Moreover, when the channel is unknown and is estimated by using pilot 
symbols, the decoded code bits with high reliability can act as pilots for chaimel re- 
estimation. By performing iterative channel estimation and data detection, the receiver 
performance can be fiirther improved. The invention can be used in other applications as 
well, such as in decoding linear dispersion MIMO systems. 

[0009] In one aspect of the invention, a method for demodulating data from a 
channel includes receiving a priori probability values for symbols transmitted across the 
channel, in accordance with the a priori probability values, determining a set of Monte 
Carlo samples of the symbols weighted with respect to a probabiUty distribution of the 
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symbols, and estimating a posteriori probability values for the symbols based on the set of 
Monte Carlo samples. 

[0010] A related program storage device and receiver apparatus are also 
provided. 

BRIEF DESCRIPTION OF THE DRAWINGS 

[0011] These and other features, benefits and advantages of the present 
invention will become apparent by reference to the following text and figures, with like 
reference numbers referring to like structures across the views, wherein: 

[0012] Fig. 1 illustrates a transmitter structure of a coded multiple-input, 
multiple-output (MIMO) system; 

[0013] Fig. 2 illustrates a receiver structure of a coded MIMO system; 

[0014] Fig. 3 illustrates a stochastic SMC MIMO demodulation process; 

[0015] Fig. 4 illustrates a deterministic SMC MIMO demodulation process; 

[0016] Fig. 5 illustrates bit error rate (BER) performance of various MIMO 
demodulation algorithms in an uncoded MIMO system; 

[0017] Fig. 6 illustrates BER performance of a turbo MIMO receiver 
employing the stochastic sequential Monte Carlo (SMC) MIMO demodulator, where 
channel estimation is based on pilots only; 

[0018] Fig. 7 illustrates BER performance of a turbo MIMO receiver 
employing the stochastic SMC MIMO demodulator, where iterative channel estimation is 
based on pilots and the decoded symbols with high reliability; 

[0019] Fig. 8 illustrates BER performance of a turbo MIMO receiver 
employing the stochastic SMC MIMO demodulator, where genie-aided channel estimation 
is based on pilots and all information symbols; 

[0020] Fig. 9 illustrates BER performance of a turbo MIMO receiver 
employing the deterministic SMC MIMO demodulator, where channel estimation is based 
on pilots only; 
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[0021] Fig. 10 illustrates BER performance of a turbo MIMO receiver 
employing the deterministic SMC MIMO demodulator, where iterative channel estimation 
is based on pilots and the decoded symbols with high reliability; and 

[0022] Fig. 1 1 illustrates BER performance of a turbo MIMO receiver 
employing the deterministic SMC MIMO demodulator, where genie-aided channel 
estimation is based on pilots and all information symbols. 

DETAILED DESCRIPTION OF THE INVENTION 
[0023] The description is organized as follows. Section 2 describes the system 
under consideration. Background materials on the SMC methodology are provided in 
Section 3. In Section 4, we derive the new class of soft MIMO demodulation algorithms 
using the stochastic SMC and the deterministic SMC, based on the simple nulling and 
cancellations BLAST detection scheme. Computer simulation results are provided in 
Section 5, and conclusions are drawn in Section 6. 
[0024] 2 System Descriptions 

[0025] In this section, we consider a generic coded MIMO system with a turbo 
receiver. The transmitter and receiver structures are shown in Fig. 1 and Fig. 2, 
respectively. In Fig. 2, Tl denotes the interleaver and FT^ denotes the deinterleaver. 

[0026] 2.1 Transmitter Structure 

[0027] At the transmitter 100, a block of information bits {a/} are encoded into 
code bits {A/} at a channel encoder 1 10. The code bits are then randomly interleaved at an 
interleaver 120 and mapped to an M-PSK or M-QAM modulation symbol stream using a 
QPSK modulator 130, taking values from a finite alphabet set^ - {ai, aa, - • , am}- Each 
symbol is serial-to-parallel (S/P) converted at an S/P converter 140 to nj sub-streams via 
demultiplexing, and each sub-stream is associated with one of a number nj of transmit 
antennas 150. At each time instance, one symbol from each sub-stream is transmitted from 
its corresponding antenna, resulting in nr symbols transmitted simultaneously in the same 
frequency band. Such a space-time bit interleaved coded modulation (BICM) allows for a 
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better exploitation of the spatial, temporal and frequency diversity resources available in 
the wireless MIMO systems [16]. 

[0028] We consider a MIMO system with nr transmit and hr receive antennas 
under the BLAST assumption, i.e., assxmiing that riR > nr in order to guarantee that the 
transmitted data symbols can be uniquely decoded at the receiver. The wireless channel is 
assumed to have rich-scattering and flat fading. The fading between each transmit and 
receive antenna pair is assumed to be independent. The channel is also assumed quasi- 
static, i.e., it is static over a data burst and changes from burst to burst. 

[0029] At the receiver 200, after matched filtering and symbol rate sampling at 
the receiver's antennas 205, the received signal vector from all hr receive antennas 205 is 
denoted as 

In complex baseband representation, the received signal can be expressed as the linear 
combination of the transmitted signal ^ ^^^^^ ' ^ ' (*)f 

|^(i) = */^H«(0 + t;(i), i=l^..iV, (1) 

V ^ 

where ^^^^ ^ -^^^ ^ is the total signal energy at the transmitter, ^ ^ *^ " is the 

complex fading channel matrix, ^ ^^""^^^ ^"^""^ is the spatially and temporally white 
Gaussian noise, and iVis the data burst length. 

[0030] Denote ^ ^ ^^H, received signal is matched-filtered and 
whitened, to obtain (here for simplicity, we drop the time index /) 

u^n-^H^y^yf^ n^s + (2) 

where ^ ^''^^^ ^ ^e(0,/«^). ^2), the maximum likelihood (ML) MIMO 

detector is given by: 
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s = QXg rain 



m 



Subsequently, the complexity of the ML receiver grows exponentially in the number of 
transmit antennas. Alternative sub-optimal approaches such as the method based on zero- 
forcing nulling and cancellation with ordering, and the method based on MMSE nulling 
and cancellation with ordering [1,6], have lower complexities but also incur substantial 
performance loss. The sphere decoding method has recently emerged as a near-optimal 
BLAST detection algorithm. However, its complexity is cubic in term of the number of 
transmit antennas [7, 8]. 

[0031] 2.2 Turbo Receiver 

[0032] Since the combination of M-PSK or M-QAM modulation and symbol- 
antenna mapping effectively acts as an inner encoder in the MIMO transmitter, the whole 
system can be perceived as a serial concatenated system and an iterative (turbo) receiver 
[17, 18] can be designed for such a system as shown in Fig. 2. The turbo receiver 200 
includes two stages: the soft-input soft-output SMC demodulator 210 to be developed in 
Section 4 followed by a soft channel decoder 260. The two stages are separated by a 
deinterleaver 250 and an interleaver 230. 

[0033] The soft MIMO demodulator 210 takes as the input the extrinsic 
information h[bi] delivered by the channel decoder 260 in the previous turbo iteration as 
well as the received signals u given by (2). First, the symbol a priori probability is 
calculated at the symbol probability calculator 220 as follows. Assxmiing QPSK 
modulation is employed, the bit pair {fij^i, fija} is mapped to symbol aj . Additionally, 
assume that in the MIMO systems, the transmitted symbol Sk corresponds to the interleaved 
code bit pair {fe;,^;;, bn(k2)), for ^= 1, . . . , . The a priori symbol probability for each 
symbol ay e ^ is then given by 

^ P{sk^ ai) = P (^,(fc4) « fi^x) ^ P ( = . (4) 
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where the code bit probability can be computed from the corresponding log likelihood ratio 
(LLR) as follows [19]: 



i[l+tanh (|exp(A2[^l))]. 



(5) 



Note that initially the extrinsic information is set to be zero so that the QPSK symbols are 
equally probable. 

[0034] At the output of the soft demodulator 2 1 0 are the a posteriori symbol 
probabilities P {sk = a, | w), A: = 1, 2, • • , nrj' =1,2," Denote 



The exact expression of P(sk = aj \ u) is given by 



p{u) 



} 



(6) 



DC 



P(su = aj) exp U L - J^fi^B J llPisi). 



Since the sunMnations in (6) are over all the -^^"^possible vector s in S)Pj = 1, . . . , M, 
its complexity is exponential in the number of transmit antennas and impractical for 
systems with high spatial-multiplexing gain. Here, we develop some low complexity 
algorithms for approximating (6). Based on the symbol a posteriori probabilities computed 
by the soft demodulator 210, the bit LLR computer 240 calcxilates the a posteriori log- 
likelihood ratios (LLRs) of the interleaved code bits b„(i). Assuming that the code bit hn(i) is 
included in a QPSK symbol ^ -^f the LLR of this code bit is given by 

(7) 
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Ai[6^(i)] = log _ — 

L vJJ p^5^^^^^o|„J 

[0035] Using the Bayes' rule, (7) can be written as 

where the second term in (8), denoted by A,2[bn(i)], represents the a priori LLR of the code 

bit b„(i), which is computed by the channel decoder 260 in the previous iteration, 

interleaved at the interleaver 230 and then fed back to the soft MIMO demodulator 210. 

For the first iteration, it is assumed that all code bits are equally likely. The first term in 

(8), denoted by A,i[bn{i)], represents the extrinsic information delivered by the soft MIMO 

demodulator 210, based on the received signals w, the MIMO signal structure, and the a 

priori information about all other code bits. The extrinsic information X.|[brt(i)] is then 

deinterleaved at the deinterleaver 250 and fed back to the channel decoder 260, as the a 

priori information for the channel decoder. The soft decoder 210 takes as input the a 

priori LLRs of the code bits and deHvers as output an update of the LLRs of the coded bits, 

as well as the LLRs of the information bits based on the code constraints. The soft channel 

decoding algorithm [20] computes the a posteriori LLR of each code bit, 

. ^ . Pjbj ^ 1 I code constraints) 

A2[eiJ log p^^^ ^ ^ j ^Xx{bi)}v, code constraints) 

where the factorization (9) is shown in [19]. It is seen fix)m (9) that the output of the soft 
decoder is the sum of the a priori information Ai[6i], and the extrinsic information X2{b\\ 
delivered by the channel decoder 260. This extrinsic information is the information about 
the code bit bi gleaned fi-om the a priori information about the other code bits, {A|[fei]}i;ti, 
based on the constraint structure of the code. Note that at the first iteration, the extrinsic 
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information and { A2[fti]}/ are statistically indq)endent. But subsequently, since 

they use the same information indirectly, they will become more and more correlated and 
finally the improvement through the iterations will diminish. 
[0036] 2.3 MIMO Channel Estimation 

[0037] Since the receiver 200 has no knowledge of channel state information in 
realistic MIMO systems, pilot symbols embedded in the data stream are required to 
estimate the channel impulse response. Assume 7'> w^time slots are used at the beginning 
of each data burst to transmit known pilot symbols 5" = [su 52, • • , 5t ]. Denote the 
corresponding received signal as F s [yi, • • , ]. Then we have 

where V= [v/, • • • , v/^]. In [21], two forms of channel estimators at the channel estimator 
270 are given, namely, the maximum likelihood (ML) estimator, given by 

Aj^L = .fEyS^'iS^r'; (11) 

V «T 

and the minimum mean-square error (MMSE) channel estimator given by 

It is also shown in [21, 22] that the optimal training sequence in the sense of 
minimizing the channel estimation error should satisfy the following orthogonality 
condition 

SS^^T Inr^ (13) 

One way to produce such a training sequence is to use the complex Walsh codes generated 
by the Hadamard matrix. As shown in Fig. 2, at the end of each turbo iteration, the a 
posteriori LLRs of the code bits are fed back from the soft channel decoder 260 through 
the interleaver 280 to the MIMO channel estimator 270 for channel re-estimation. If the 
LLRs of all 2nr code bits corresponding to one QPSK MIMO symbol are above a 
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predetermined threshold, the corresponding decoded MIMO symbol is considered as 
having high-reliability and will act as a training symbol for channel reestimation at the 
beginning of the next turbo iteration. Intuitively, as the turbo receiver 200 iterates, such a 
scheme obtains more and more accurate channel estimation since it makes use of more and 
more training symbols. Accordingly, the overall receiver performance will be improved 
compared with the scheme where the channel is estimated only once using the pilot 
symbols before the first turbo iteration. 

[0038] 3 Background on Sequential Monte Carlo 

[0039] Sequential Monte Carlo (SMC) is a class of eflRcient methods to obtain 
random samples fi-om a sequence of probability distributions known only up to a 
normalizing constant. A general fi^amework for the SMC methods is briefly explained next. 
Consider the following generic sequence of the a posteriori probability distributions {p 
{Xk\Yk)}k>Q where = (x<?, x/, . . . , Xk) is the set of imobserved parameters to be estimated 
and Yk = (yo, j^;, . . . , yi^ is the set of available observations at index k. We emphasize the 
fact that k is not necessarily a time index. Typically a posteriori distributions are only 
known up to a normalizing constant as p{Yk) = J p{Yk^i) p(Xk)dXk is not available in closed 
form and thus SMC is of great interest in this context. Assume one wants to compute the 
minimum mean-square error (MMSE) estimate of h (Xk\ which is given by 

E {h (Xfc) I Yfe} = jh (Xf,) p (X, I Kfc) dXfe. (14) 



_ k f ^ 

Given m random samples distributed according to p(Xk\ Yk), this expectation 

can be approximated numerically through 

E{hiXu)\Y,} = -'£h{x^^). (15) 

Sampling directly from piXk\Yk) is often not feasible and it is necessary to derive 
approximate methods. 
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[0040] 3.1 Importance Sampling 

[0041] Sampling directly from p(Xk\ Yk) is often not feasible or too 
computationally expensive, but drawing samples from some trial density "close" to the 

distribution of interest is often easy. In this case, we can use the idea of importance 

, m 



sampling. Suppose a set of random samples distributed according to q (Xk\ 

Yk) is available. We can still come up with vaUd estimates of (14) by correcting for the 
discrepancy between p(Xk\ Yk) and g{Xk\ Yk). By associating the importance weight 



to the sample , we can indeed estimate (14) as 



with The pair ^ ^ ^"^^ is called a properly weighted sample with 

respect to the distribution p(Xk| Yk). It is crucial to notice that one only needs to compute 
the weights up to a normalizing constant so that the knowledge of p(Yk) is not necessary. 
In the "optimal" case where q(Xk| Yk) = p(Xk| Yk), then all the weights are equal and have 
zero variance. Roughly speaking, the performance of the method typically deteriorates 
when the variance of the weights increases. 

[0042] The importance sampling method is a generic method that is not 
sequential. However, one may come up with a sequential version of it. Suppose a set of 

properly weighted samples with respect to p(Xk-\\ Yk~\) is available at 

index k-L Based on these samples, we want to generate a new set of samples 
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^ properly weighted with respect to p(Xk\ Yk). The Sequential Importance 
Sampling (SIS) algorithm proceeds as follows. 

• Draw a sample x^^k from a trial distribution ^ ^ and let 

• Update the importance weight 



9(^ain-,)9(^^«|jfi'i,n) 



[0043] One can check that the above algorithm indeed generates properly 
weighted samples with respect to the distribution p{Xk\ Yk). It is indeed simply an 
importance sampling method with trial distribution satisfying 

q {Xk \Yk)^q (Xk-i \ Yk^i) q (xk f Xk^u Yk) . 

This procedure is very general but it is crucial to design a "good'' trial distribution to 
obtain good performance. It is easy to establish that the optimal importance distribution - 
optimal as it minimizes the conditional variance of the weights var {wk\Xk-i, Yk} [23]- is 
given by 

q{xk \Xk^i.Yk)^p{^k\Xk^^,Yk). (IT) 
In this case, the importance weight recursion is 

[0044] 3.2 Resampling 

[0045] The importance weight measures the "quality*' of the corresponding 
imputed signal sequence A^^it • A relatively small weight implies that the sample is drawn 
far from the main body of the a posteriori distribution and has a small contribution in the 
final estimation. Such a sample is said to be ineffective. Whatever being the importance 
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distribution used, the SIS algorithm is getting inefficient as k increases. This is because the 
discrepancy between q(Xk\ Yk) and p(Xk\ Yk) can only increase with k. In practice, after a 
few steps of the procedure, only one stream dominates all the others; i.e., its importance 
weight is close to 1 whereas the others are close to 0. To make the SIS procedure efficient 
in practice, it is necessary to use a resampling procedure as suggested in [24]. Roughly 
speaking, the aim of resampling is to duplicate the streams with large importance weights 
while eliminating the streams with small ones; i.e., one focuses the computational efforts 
in the promising zones of the space. Each sample is copied kj times with 
k: = m 

j^i 3 * Many resampling procedures have been proposed in the literature [11, 14, 
15, 24, 25]. We use here the systematic resampling procedure proposed in [25]. 

• Compute the cumulative distribution po = 0, pj = pj-\ + /Wk fory = 1, . . . , aw. 

• Draw a random number U uniformly in [0, 1], Compute Uj = (U+jJ/m with 7 = 0,. 
. . , m - 1. 



• Assign uniform weights * ~ to the new set of streams *^ 
[0046] This algorithm is very computationally efficient. In the class of unbiased 

resampling schemes, i.e., E {kj} = mw^Vi, this algorithm minimizes the variance var {kJ 

and displays better performance than other algorithms. 

[0047] Note that if resampling is used in conjunction with the optimal 

importance distribution (17), one should perform the resampling step before the sampling 

step at time k as, in this very specific case, the importance weight at time k is independent 

of the samples . 

[0048] Resampling can be done at every fixed-length time interval (say, every 
five time steps or otherwise periodically) or it can be conducted dynamically. The effective 
sample size is a criterion that can be used to monitor the variation of the importance 
weights of the sample streams, and to decide when to resample as the system evolves. The 
effective sample size is defined as 



• SetKj={#i € fO, ...,m - i; ;py-i < C// <pj} . 
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where Vk, the coefficient of variation, is given by 
m^\w* ) " 

with ^i=i * * Roughly speaking, the effective sample size is a measure of 

variation over the weights. In the optimal case, where all the weights are equal, then it is 
maximum and equal to m. When one weight dominates all the others, then it is very small. 

In dynamic resampling, a resampling step is performed once the effective sample size 
is below a certain threshold. An alternative criterion such as the entropy of the weights 
could also be used. 

[0049] Heuristically, resampling can provide chances for good sample streams 
to amplify themselves and hence "rejuvenate" the samples to produce better results in the 
next steps. It can be shown that if m is sufficientiy large the resampled streams drawn by 
the above resampling scheme are also properly weighted with respect to p(Xk\ Yk). In 
practice, when small to modest m is used (We use m = 64 here), the resampling procedure 
can be seen as a trade-off between the bias and the variance. That is, the new samples with 
their weights resulting from the resampling procedure are only approximately proper, 
which introduces small bias in the Monte Carlo estimates. On the other hand, resampling 
significantly reduces the Monte Carlo variance for the future samples. 

[0050] 3.3 An Alternative Deterministic Procedure 

[0051] SMC is a very general set of methods to sample probability distributions 
on any space one wants. We restrict ourselves to a case of great interest in 
telecommunications; namely the case where Xk can only take values in a finite set say X, In 
this case flie a posteriori distribution p(Xk\ Yk) could be computed exactly but it is typically 
far too computationally expensive when k is large as Xk can take \X\^ possible values where 
\X\ is the number of elements in X, 
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[0052] In this very specific case, the SIS procedxire with the optimal 
importance distribution (17) takes the following fomi. 

• Draw a sample x^^k from p(xk|X<^Vh Yk) and let = (X^Vi, x\y 

• Update the importance weight 

[0053] The importance weight is computed using 

p(yk I Xk^j.Yk^j) --J^pivkl Xk,Yk^x)p(xk\Xk^i). (18) 

Two points can be criticized about this algorithm. First, assume that the number of streams 
m is equal to \X\^ with I <n,(lfl>k then one coxild compute p(Xk\ Yk) exactly). The 
algorithm should enumerate the first possibiUties and compute their a posteriori 
probabilities; i.e., one should compute p{Xi\ Yi) exactly. This can be easily incorporated in 
the initialization phase of an SMC algorithm. Second, one can see that the calculation of 
the weights (18) involves computing the a posteriori distribution up to a normalizing 
constant of m\X\ streams. Indeed for each A^Vi, one needs to compute p(yk\X^\-h Xh Yk^\) 
for Xk e X. By sampling this information is somehow discarded. An alternative 
deterministic approach consists of keeping among the /w|Al "candidate" trajectories at time 

{x^l^^xkExY' , 

At, ^ the w trajectories with the Wghest a j905ter/on distribution. This 

algorithm can be interpreted as an extended Viterbi algorithm with m survivors. Clearly the 
local approximation error one commits at the selection step using this deterministic 
strategy is lower than using the randomized strategy. Nevertheless it does not guarantee 
performance to be better overall. 

[0054] To sum up, given ^ ^ with a posteriori distributions (known up 
to a normalizing constant) ^ the algorithm proceeds as follows at time k. 
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• Compute 

n^M oc u^2x -piVkl .YgpXfc,n-i),Xjk € X. (19) 

• Select and preserve only the m "best" distinct streams amongst the /w^X] 
hypotheses. 

[0055] The drawback of this approach is that clearly the past observations are 
as important as the current observation in the calculation of the weight. If an error is 
committed then it has a significant impact on further decisions. The randomized algorithm 
is less sensitive to this problem as when the streams are resampled their weights are set to 
l/m. 

[0056] It is worth noticing that both the stochastic and deterministic algorithms 
are well suited for parallel implementation, 

[0057] 4 Soft MDVIO Demodulation Algorithms 
[0058] In this section, we will derive soft-input soft-output MIMO 
demodulation algorithms based on the sequential Monte Carlo principle. The importance 
sampling density is obtained by utilizing the artificial sequential structure of the existing 
simple nulling and cancellation BLAST detection scheme. First, the channel parameters 
are assumed to have been estimated perfectly at the receiver through the short training 
sequence before detection. Later, we will examine the effect of channel estimation error on 
systems performance with pilot symbols along with high-quality symbol decision 
feedback. 

[0059] 4.1 Simple Nulling and Cancellation BLAST Detection Algorithm 
[0060] Consider the signal model (2), we denote the QR-decomposition of O^^^ 

as 

Q^^^=QR, (20) 
where g is a imitary matrix and 1? is an upper triangular matrix. The nuUing operation is a 
coordinate rotation that left-multiplies the vector u on (2) by to produce a sufficient 
statistic, 
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z =Q"u = J^Rs + v, 



(21) 



where Q^v. gj^gg q jg vinitary, there is no noise enhancement and the noise 
whitening characteristic maintains by nulHng, i.e., ^ ~ ^«t)- rewrite (21) 



as 



-1 




^2 


= /A 




y ivp 









Si 
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R 



■V" 



(22) 



We can detect data symbols directly by nulling operation, i.e., multiply z by R'^ However 
in [5], it is shown that by utilizing the upper triangular structure of R, significant 
improvement over zero-forcing can be obtained with the following successive interference 
cancellation method: 



— )' 



(23) 



where Z,(x) = arg minyie^(|x - ^/|). Although the above simple nulling and cancellation 
scheme has a very low complexity, its performance is much worse than that of the methods 
based on zero forcing or MMSE nulling and interference cancellation with ordering as well 
as that of the sphere decoding algorithm (See Fig. 5). In what follows, we develop SMC- 
based MIMO demodulation algorithms using the above simple nulling and cancellation 
method as the kernel. As will be seen in Section 5, these new algorithms provide near- 
optimal performance in both uncoded and coded MIMO systems. 
[0061] 4.2 Stochastic SMC MEMO Demodulator 
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[0062] As from (22), the artificial sequential structure of the simple nulling and 
interference cancellation scheme due to the upper-triangular structure is well suited for 
applying the SMC method to MIMO data detection with the particularity of operating on 
spatial domain starting from antenna nr to antenna 1. Indeed one has 

ITT 

P{s I u)^ P{s I z) oc llp{^k I S,)P{Sk). (24) 
where = f^^* " * ' ^^t) = i^k^ ^nrl We can thus use SMC methods to 

simulate from the sequence of probability distributions "«t»«t 
sequence of "artificial" distributions is defined by 

P (Skl Zk) oc l[p (zi I St) P (si) . (25) 

[0063] The aim of SMC MIMO detection is to compute an estimate of the a 
posteriori symbol probability 

P(3k = ai\^), ai€A, k = l,...,i^, (26) 



Let V / be J 



based on the received signal z after nulling. Let ^ ^ be a 

sample drawn by the SMC at each symbol interval, where m is the number of samples. In 
order to implement the SMC, we need to obtain a set of Monte Carlo samples of the 
transmitted symbols {s^\ , w^\}properly weighted with respect to the distribution of p(s\z). 
In the application of MIMO demodulation, the function h (•) in (15) is specified by the 
indicator function 1( ) as 

Hence, the a posteriori probability of the information symbol Sk can then be estimated as 
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P{Bk = ai\z) = B{l{sk = Oi) \ z) 

where 

[0064] Following ( 1 7) in Section 3. 1 , we choose the trial distribution as: 
9 (sfe I Zk> oc P (sk I ifc,^i) . (29) 
For this trial distribution, the importance weight is updated according to 

"'ib - ^k+i 



oc (30) 

We next specify the computation of the predictive distribution in (29) and (30). First, we 
consider the trial distributions in (29) 

q(sk\Zk,^l^ a p{zk\Bk,Zk+i,S^^'P{sk) (31) 

Since the noise » in (21) is white Gaussian i.e., ^ ~ -^^0(0, /«,,)) -^q have 

)~-^c(/ig,l), (32) 

where the mean is given by 

''i!i = (v'l£;M^?' + y/Jr.,a.). (33) 



We denote 
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ttfci - p{^k\Zk+i,S^Jli,3k = ai) P(8k = ai) (34) 
= 7exp|-||^fc-Mg]f|p(sfc = Oi). (35) 

Hence, the predictive distribution in (30) is given by 

I] P (^A I , Sk = Oi) P{sk = Oi) 

= E"g (36) 

[0065] Referring to Fig. 3, we summarize the stochastic SMC MIMO 
demodulation process, which begins at block 300, as follows: 

0. Initialization: all importance weights are initialized as w^^-i = 1,7 = 1, . . . , m 
(block 305). 

At block 310, the following steps are implemented at the ^-th recursion (k=nT, nr 
- 1, ... , 1) to update each weighted sample. For y = 1, . . . , m: 

1. At block 315, compute the trial sampling density a^\/ for each a,- sA according 
to (35) with the a priori symbol probability P(sk = ai) obtained from the last turbo iteration, 
calculated using (4). 

2. At block 320, draw a sample from the set A with probability 

p(% = a,.|s|5^,Zfc) a ag], o^eA (37) 

3. At block 325, compute the importance weight 

oc w^rEo^l (38) 

4. At block 330, compute the a posteriori probability of the information symbol Sk 
according to (28). 
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5. At block 335, the process is repeated for the next sample in the recursion until 
the last (j=tn) sample is reached. At block 340, resampling may be performed as described 
in Section 3.2. Note that the resampling does not necessarily occur in the sequence 
indicated in Fig. 3. At block 345, the process is repeated for the next recursion until the 
last recursion (k=l) is reached. At block 350, the process ends. 

[0066] The stochastic and deterministic demodulation processes of the 
invention may be implemented using any known hardware, firmware and/or software. For 
example, the demodulator 210 may include an ASIC that is programmed to perform the 
desired calculations. A program storage device such as a memory may tangibly embody a 
program of instructions that are executable by a machine such as a microprocessor to 
perform the demodulation. The program storage device and microprocessor may be part 
of, or otherwise associated with, the demodulator 210 and possibly other components in 
the receiver 200. Moreover, the functionaUty of any of the components in the receiver 200 
may be implemented similarly. 

[0067] 43 Deterministic SMC MIMO Demodulator 

[0068] The deterministic method for estimating the sequence of probability 

distributions ^ ^ ^ proceeds as follows. Similarly to the stochastic 

case, we assimie that m samples are drawn at each iteration where m=\A\^ with I <nT. 
Note that if / = , it is equivalent to the maximum likelihood detection and involves 
l-^l^ calculations. In the exploration step, we compute exactly the distribution 

^ nr i+i I nr enumerating all m particles, e.g., samples, for antenna nj-down 

to antenna «r -/ +1 . As a result, we have a set of m distinct QPSK sequences 

^"^^ With weights ^ ^^-^ satisfying 
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4Ul = ^(^-/+il^«r-l+l) 



n f&. 



(39) 



where 



is given by (33). 

[0069] The second step is the extended Viterbi algorithm as described in 
Section 3.3 performed from antenna nr- I to antenna 1 (i.e., ^ = wr - /, . . . , 1). We need 
to update the importance weight according to 

(42) 

where the m distinct QPSK symbol sequences with highest importance weights are 
selected as the survivor paths over m\A\ hypotheses. The deterministic SMC MIMO 
demodulation algorithm, which begins at block 400, is summarized in Fig. 4 as follows: 

0. Initialization: for A: = , . . . , - / + 1, calculate the exact expression of the 
probability distribution via (41) by enumerating all m particles (samples) (block 405). 

At block 410, the following steps are implemented at the Mi recxirsion {k=nT- I, 
nr - I - 1, . . . , 1) to update each weight sample. For y = 1, . . . , m: 

1 . At block 415, compute the importance weight 

(43) 

where ;8^\, is given by (41). 
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2. At block 420, select and preserve only the m "best" distinct streams * with the 
highest weights amongst the m\A\ hypotheses with weights set 

3. At block 425, compute the a posteriori probability of the information symbol Sk 
according to (28). 

At block 430, the process is repeated for the next sample in the recursion imtil the 
last 0=ni) sample is reached. At block 435, the process is repeated for the next recursion 
until the last recursion (k= «r - / + 1) is reached. At block 440, the process ends. 

[0070] 5 Simulation Results 

[0071] In this section, we provide computer simulation results to illustrate the 
performance of the proposed turbo receivers in flat-fading MIMO channels with nT=nR = 

8. The fading coefficients are generated according to (0. l)^ ^here i.i.d. denotes 

"independent, identically distributed". Channels are assumed to be quasi-static, i.e., they 
remain constant over the entire frame of N symbols, but vary from frame to frame. 
Simulation results are obtained by averaging over 500 channel realizations. The number of 
samples drawn in the stochastic SMC algorithm is m = 64. The number of samples drawn 
in the deterministic algorithm is m = 64 (i.e., / = 3) in the uncoded case and m=\6 (i.e., / = 
2) in the coded case. 

[0072] 5.1 Performance in Uncoded MEMO Systems 
[0073] We first illustrate the performance of the proposed stochastic and 
deterministic SMC MIMO demodulation algorithms in an uncoded MIMO system. The 
BER performance of these two new algorithms, together with some existing detection 
algorithms (including the sphere decoding algorithm, the method based on MMSE nulling 
and cancellation with ordering, the method based on zero-forcing nulling and cancellation 
with ordering, as well as the simple nulling and cancellation method) are shown in Fig. 5. 
QPSK modulation is used. Several observations are in order. First of all, the simple 
nulling and cancellation method displays very poor performance. However, when it is 
combined with SMC (deterministic or stochastic), we obtain a much more powerfiil MIMO 
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detector. Secondly, the deterministic detector outperforms the stochastic SMC detector and 
it actually slightly outperforms the sphere decoding algorithms, which is the best 
suboptimal MIMO detector known so far! 

[0074] 5.2 Performance in Coded MIMO Systems 

[0075] For the coded MIMO system, a rate- 1/2 constraint length-5 
convolutional code (with generators 23 and 35 in octal notation) is employed at the 
transmitter. The bit interleaver is randomly generated and fixed throughout the simulations. 
The code bit block size is 5 12, corresponding to 256 information bits and 32 QPSK MIMO 
symbols. The channel estimator employed here is the MMSE MIMO channel estimator 
given by (12), where T= nr= 8, and QPSK orthogonal MIMO pilot symbols are used. The 
number of turbo iterations in each simulation run is four. 

[0076] In Figs. 6, 7 and 8, the BER performance of the turbo receiver 
employing the stochastic SMC demodulator is plotted. "Iter" denotes "iteration", "ch" 
denotes "channel" and "est." denotes "estimated". The solid curves in these figures 
correspond to the BER performance with perfectly known channel; whereas the dashed 
curves correspond to the BER performance with different channel estimation schemes. In 
Fig. 6, the channel is estimated only once based on the pilots and used throughout all turbo 
iterations. In Fig. 7, the channel is re-estimated at the beginning of each turbo iteration 
using both the pilots and the decoded symbols with high-reliability, as discussed in Section 
2.3. And in Fig. 8, we assume the channel is estimated by a genie that knows both the pilot 
symbols and all information symbols. Such a genie aid channel estimate is used throughout 
all turbo iterations. The scenario for Fig. 8 provides an upper bound on the achievable 
performance for the turbo receivers with different channel estimation schemes. It is seen 
that by performing iterative channel estimation and symbol detection, tiie turbo receiver 
offers a performance approaching the genie-aided bound. 

[0077] The corresponding BER performance of a turbo receiver employing the 
deterministic SMC demodulator is shown in Figs. 9, 10 and 1 1. It is seen that the 
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deterministic SMC algorithm offers improved and more stable performance than its 
stochastic counterpart. 

[0078] 6 Conclusions 

[0079] The invention provides a new family of demodulation algorithms 
particularly useful for soft-input soft-output MIMO demodulation. These new techniques 
illustratively use the conventional BLAST detection based on simple nulling and 
cancellation as the Icemel, and are based on the sequential Monte Carlo (SMC) method for 
Bayesian inference. Two versions of such SMC MIMO demodulation algorithms are 
developed, based on respectively stochastic and deterministic sampling. As hard MIMO 
detection algorithms, the proposed SMC demodulation algorithms significantly outperform 
all existing BLAST detection methods. Moreover, the deterministic SMC MIMO detector 
slightly outperforms the sphere decoding algorithm. Furthermore, in coded MIMO 
systems, the proposed SMC algorithms can naturally serve as the soft MIMO demodulator 
in a turbo receiver. Simulation results indicate that overall the deterministic SMC MIMO 
demodulator offers better and more stable performance compared with its stochastic 
coxmterpart in both the uncoded and the coded MIMO systems. 

[0080] The invention has been described herein with reference to particular 
exemplary embodiments. Certain alterations and modifications may be apparent to those 
skilled in the art, without departing from the scope of the invention. The exemplary 
embodiments are meant to be illustrative, not limiting of the scope of the invention, which 
is defined by the appended claims. 
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